Hermite polynomials

In mathematics, the Hermite polynomials are a classical orthogonal polynomial sequence that arise in probability, such as the Edgeworth series; in combinatorics, as an example of an Appell sequence, obeying the umbral calculus; and in physics, where they give rise to the eigenstates of the quantum harmonic oscillator. They are named in honor of Charles Hermite.

Contents

Definition

(1)\ \ H_n(x)=(-1)^n e^{x^2/2}\frac{d^n}{dx^n}e^{-x^2/2}\,\!

(the "probabilists' Hermite polynomials"), or sometimes by

(2)\ \ H_n(x)=(-1)^n e^{x^2}\frac{d^n}{dx^n}e^{-x^2}=e^{x^2/2}\bigg (x-\frac{d}{dx} \bigg )^n e^{-x^2/2}\,\!

(the "physicists' Hermite polynomials"). These two definitions are not exactly equivalent; either is a rescaling of the other, to wit

H_n^\mathrm{phys}(x) = 2^{n/2}H_n^\mathrm{prob}(\sqrt{2}\,x).\,\!

These are Hermite polynomial sequences of different variances; see the material on variances below.

Below, we usually follow the first convention. That convention is often preferred by probabilists because

\frac{1}{\sqrt{2\pi}}e^{-x^2/2}

is the probability density function for the normal distribution with expected value 0 and standard deviation 1.

The first five (probabilists') Hermite polynomials.

The first eleven probabilists' Hermite polynomials are:

H_0(x)=1\,
H_1(x)=x\,
H_2(x)=x^2-1\,
H_3(x)=x^3-3x\,
H_4(x)=x^4-6x^2+3\,
H_5(x)=x^5-10x^3+15x\,
H_6(x)=x^6-15x^4+45x^2-15\,
H_7(x)=x^7-21x^5+105x^3-105x\,
H_8(x)=x^8-28x^6+210x^4-420x^2+105\,
H_9(x)=x^9-36x^7+378x^5-1260x^3+945x\,
H_{10}(x)=x^{10}-45x^8+630x^6-3150x^4+4725x^2-945\,
The first five (physicists') Hermite polynomials.

and the first eleven physicists' Hermite polynomials are:

H_0(x)=1\,
H_1(x)=2x\,
H_2(x)=4x^2-2\,
H_3(x)=8x^3-12x\,
H_4(x)=16x^4-48x^2+12\,
H_5(x)=32x^5-160x^3+120x\,
H_6(x)=64x^6-480x^4+720x^2-120\,
H_7(x)=128x^7-1344x^5+3360x^3-1680x\,
H_8(x)=256x^8-3584x^6+13440x^4-13440x^2+1680\,
H_9(x)=512x^9-9216x^7+48384x^5-80640x^3+30240x\,
H_{10}(x)=1024x^{10}-23040x^8+161280x^6-403200x^4+302400x^2-30240\,

Properties

Hn is a polynomial of degree n. The probabilists' version has leading coefficient 1, while the physicists' version has leading coefficient 2n.

Orthogonality

Hn(x) is an nth-degree polynomial for n = 0, 1, 2, 3, .... These polynomials are orthogonal with respect to the weight function (measure)

w(x) = \mathrm{e}^{-x^2/2}\,\!   (probabilist)

or

w(x) = \mathrm{e}^{-x^2}\,\!   (physicist)

i.e., we have

\int_{-\infty}^\infty H_m(x) H_n(x)\, w(x) \, \mathrm{d}x = 0

when m ≠ n. Furthermore,

\int_{-\infty}^\infty H_n(x) H_n(x)\, \mathrm{e}^{-x^2/2} \, \mathrm{d}x = n! \, \sqrt{2 \pi}   (probabilist)

or

\int_{-\infty}^\infty H_n(x) H_n(x)\, \mathrm{e}^{-x^2}\, \mathrm{d}x = n! \, 2^n \sqrt{\pi}   (physicist).

The probabilist polynomials are thus orthogonal with respect to the standard normal probability density function.

Completeness

The Hermite polynomials (probabilist or physicist) form an orthogonal basis of the Hilbert space of functions satisfying

\int_{-\infty}^\infty\left|f(x)\right|^2\, w(x) \, \mathrm{d}x <\infty,

in which the inner product is given by the integral including the Gaussian weight function w(x) defined in the preceding section,

\langle f,g\rangle=\int_{-\infty}^\infty f(x)\overline{g(x)}\, w(x) \, \mathrm{d}x.

An orthogonal basis for L2(Rw(x) dx) is a complete orthogonal system. For an orthogonal system, completeness is equivalent to the fact that the 0 function is the only function ƒ ∈ L2(Rw(x) dx) orthogonal to all functions in the system. Since the linear span of Hermite polynomials is the space of all polynomials, one has to show (in physicist case) that if ƒ satisfies

\int_{-\infty}^\infty f(x) x^n \mathrm{e}^{- x^2} \, \mathrm{d}x = 0

for every n ≥ 0, then ƒ = 0. One possible way to do it is to see that the entire function

F(z) = \int_{-\infty}^\infty f(x) \, \mathrm{e}^{z x - x^2} \, \mathrm{d}x = \sum_{n=0}^\infty \frac{z^n}{n!}\int f(x) x^n \mathrm{e}^{- x^2} \, \mathrm{d}x = 0

vanishes identically. The fact that F(it) = 0 for every t real means that the Fourier transform of ƒ(x) exp(−x2) is 0, hence ƒ is 0 almost everywhere. Variants of the above completeness proof apply to other weights with exponential decay. In the Hermite case, it is also possible to prove an explicit identity that implies completeness (see "Completeness relation" below).

An equivalent formulation of the fact that Hermite polynomials are an orthogonal basis for L2(Rw(x) dx) consists in introducing Hermite functions (see below), and in saying that the Hermite functions are an orthonormal basis for L2(R).

Hermite's differential equation

The probabilists' Hermite polynomials are solutions of the differential equation

(e^{-x^2/2}u')' + \lambda e^{-x^2/2}u = 0

where λ is a constant, with the boundary conditions that u should be polynomially bounded at infinity. With these boundary conditions, the equation has solutions only if λ is a non-negative integer, and up to an overall scaling, the solution is uniquely given by u(x) = Hλ(x). Rewriting the differential equation as an eigenvalue problem

L[u] = u'' - x u' = -\lambda u

solutions are the eigenfunctions of the differential operator L. This eigenvalue problem is called the Hermite equation, although the term is also used for the closely related equation

u'' - 2xu'=-2\lambda u

whose solutions are the physicists' Hermite polynomials.

With more general boundary conditions, the Hermite polynomials can be generalized to obtain more general analytic functions Hλ(z) for λ a complex index. An explicit formula can be given in terms of a contour integral (Courant & Hilbert 1953).

Recursion relation

The sequence of Hermite polynomials also satisfies the recursion

H_{n+1}(x)=xH_n(x)-H_n'(x).\,\! (probabilist)
H_{n+1}(x)=2 xH_n(x)-H_n'(x).\,\! (physicist)

The Hermite polynomials constitute an Appell sequence, i.e., they are a polynomial sequence satisfying the identity

H_n'(x)=nH_{n-1}(x),\,\! (probabilist)
H_n'(x)=2nH_{n-1}(x),\,\! (physicist)

or equivalently,

H_n(x+y)=\sum_{k=0}^n{n \choose k}x^k H_{n-k}(y) (probabilist)
H_n(x+y)=\sum_{k=0}^n{n \choose k}H_{k}(x) (2y)^{(n-k)} (physicist)

(the equivalence of these last two identities may not be obvious, but its proof is a routine exercise).

It follows that the Hermite polynomials also satisfy the recurrence relation

H_{n+1}(x)=xH_n(x)-nH_{n-1}(x),\,\! (probabilist)
H_{n+1}(x)=2xH_n(x)-2nH_{n-1}(x).\,\! (physicist)

These last relations, together with the initial polynomials H0(x) and H1(x), can be used in practice to compute the polynomials quickly.

Explicit expression

The physicists' Hermite polynomials can be written explicitly as

 H_n(x) = n! \sum_{\ell = 0}^{n/2} \frac{(-1)^{n/2 - \ell}}{(2\ell)! (n/2 - \ell)!} (2x)^{2\ell}

for even values of n and

 H_n(x) = n! \sum_{\ell = 0}^{(n-1)/2} \frac{(-1)^{(n-1)/2 - \ell}}{(2\ell + 1)! ((n-1)/2 - \ell)!} (2x)^{2\ell + 1}

for odd values of n. These two equations may be combined into one using the floor function:

 H_n(x) = n! \sum_{m=0}^{\lfloor n/2 \rfloor} \frac{(-1)^m}{m!(n - 2m)!} (2x)^{n - 2m}.

The probabilists' Hermite polynomials have similar formulas, which may be obtained from these by replacing the power of 2x with the corresponding power of (√2)x, and multiplying the entire sum by 2-n/2.

Generating function

The Hermite polynomials are given by the exponential generating function

\exp (xt-t^2/2) = \sum_{n=0}^\infty H_n(x) \frac {t^n}{n!}\,\! (probabilist)
\exp (2xt-t^2) = \sum_{n=0}^\infty H_n(x) \frac {t^n}{n!}\,\! (physicist).

This equality is valid for all x, t complex, and can be obtained by writing the Taylor expansion at x of the entire function z → exp(−z2) (in physicist's case). One can also derive the (physicist's) generating function by using Cauchy's Integral Formula to write the Hermite polynomials as

H_n(x)=(-1)^n e^{x^2}\frac{d^n}{dx^n}e^{-x^2}= (-1)^n e^{x^2}{n! \over 2\pi i} \oint_\gamma {e^{-z^2} \over (z-x)^{n+1}}\, dz.\,\!

Using this in the sum \sum_{n=0}^\infty H_n(x) \frac {t^n}{n!}\,\!, one can evaluate the remaining integral using the calculus of residues and arrive at the desired generating function.

Expected value

If X is a random variable with a normal distribution with standard deviation 1 and expected value μ then

E(H_n(X))=\mu^n.\,\! (probabilist)

Relations to other functions

Laguerre polynomials

The Hermite polynomials can be expressed as a special case of the Laguerre polynomials.

H_{2n}(x) = (-4)^{n}\,n!\,L_{n}^{(-1/2)}(x^2)=4^n\, n! \sum_{i=0}^n (-1)^{n-i} {n-\frac{1}{2} \choose n-i} \frac{x^{2i}}{i!}\,\! (physicist)
H_{2n+1}(x) = 2(-4)^{n}\,n!\,x\,L_{n}^{(1/2)}(x^2)=2\cdot 4^n\, n! \sum_{i=0}^n (-1)^{n-i} {n+\frac{1}{2} \choose n-i} \frac{x^{2i+1}}{i!}\,\! (physicist)

Relation to confluent hypergeometric functions

The Hermite polynomials can be expressed as a special case of the parabolic cylinder functions.

H_{n}(x) = 
2^n\,U\left(-\frac{n}{2},\frac{1}{2};x^2\right) (physicist)

where U(a,b;z) is Whittaker's confluent hypergeometric function. Similarly,

H_{2n}(x) = (-1)^{n}\,\frac{(2n)!}{n!}
\,_1F_1\left(-n,\frac{1}{2};x^2\right) (physicist)
H_{2n+1}(x) = (-1)^{n}\,\frac{(2n+1)!}{n!}\,2x
\,_1F_1\left(-n,\frac{3}{2};x^2\right) (physicist)

where \,_1F_1(a,b;z)=M(a,b;z) is Kummer's confluent hypergeometric function.

Differential operator representation

The probabilists' Hermite polynomials satisfy the identity

H_n(x)=e^{-D^2/2}x^n\,\!

where D represents differentiation with respect to x, and the exponential is interpreted by expanding it as a power series. There are no delicate questions of convergence of this series when it operates on polynomials, since all but finitely many terms vanish.

Since the power series coefficients of the exponential are well known, and higher order derivatives of the monomial xn can be written down explicitly, this differential operator representation gives rise to a concrete formula for the coefficients of Hn that can be used to quickly compute these polynomials.

Since the formal expression for the Weierstrass transform W is eD2, we see that the Weierstrass transform of (√2)nHn(x/√2) is xn. Essentially the Weierstrass transform thus turns a series of Hermite polynomials into a corresponding Maclaurin series.

The existence of some formal power series g(D), with nonzero constant coefficient, such that Hn(x) = g(D)xn, is another equivalent to the statement that these polynomials form an Appell sequence. Since they are an Appell sequence they are a fortiori a Sheffer sequence.

Contour integral representation

The Hermite polynomials have a representation in terms of a contour integral, as

H_n(x)=\frac{n!}{2\pi i}\oint\frac{e^{tx-t^2/2}}{t^{n+1}}\,dt (probabilist)
H_n(x)=\frac{n!}{2\pi i}\oint\frac{e^{2tx-t^2}}{t^{n+1}}\,dt (physicist)

with the contour encircling the origin.

Generalizations

The (probabilists') Hermite polynomials defined above are orthogonal with respect to the standard normal probability distribution, whose density function is

\frac{1}{\sqrt{2\pi}}e^{-x^2/2}\,\!

which has expected value 0 and variance 1. One may speak of Hermite polynomials

H_n^{[\alpha]}(x)\,\!

of variance α, where α is any positive number. These are orthogonal with respect to the normal probability distribution whose density function is

(2\pi\alpha)^{-1/2}e^{-x^2/(2\alpha)}.\,\!

They are given by

H_n^{[\alpha]}(x) = \alpha^{-n/2}H_n^{[1]}\left(\frac{x}{\sqrt{\alpha}}\right)=e^{-\alpha D^2/2}x^n.\,\!

In particular, the physicists' Hermite polynomials are

H_n^{[1/2]}(x).\,\!

If

H_n^{[\alpha]}(x)=\sum_{k=0}^n h^{[\alpha]}_{n,k}x^k\,\!

then the polynomial sequence whose nth term is

\left(H_n^{[\alpha]}\circ H^{[\beta]}\right)(x)=\sum_{k=0}^n h^{[\alpha]}_{n,k}\,H_k^{[\beta]}(x)\,\!

is the umbral composition of the two polynomial sequences, and it can be shown to satisfy the identities

\left(H_n^{[\alpha]}\circ H^{[\beta]}\right)(x)=H_n^{[\alpha+\beta]}(x)\,\!

and

H_n^{[\alpha+\beta]}(x+y)=\sum_{k=0}^n{n\choose k}H_k^{[\alpha]}(x) H_{n-k}^{[\beta]}(y).\,\!

The last identity is expressed by saying that this parameterized family of polynomial sequences is a cross-sequence.

"Negative variance"

Since polynomial sequences form a group under the operation of umbral composition, one may denote by

H_n^{[-\alpha]}(x)\,\!

the sequence that is inverse to the one similarly denoted but without the minus sign, and thus speak of Hermite polynomials of negative variance. For α > 0, the coefficients of Hn[−α](x) are just the absolute values of the corresponding coefficients of Hn[α](x).

These arise as moments of normal probability distributions: The nth moment of the normal distribution with expected value μ and variance σ2 is

E(X^n)=H_n^{[-\sigma^2]}(\mu)\,\!

where X is a random variable with the specified normal distribution. A special case of the cross-sequence identity then says that

\sum_{k=0}^n {n\choose k}H_k^{[\alpha]}(x) H_{n-k}^{[-\alpha]}(y)=H_n^{[0]}(x+y)=(x+y)^n.\,\!

Applications

Hermite functions

One can define the Hermite functions from the physicists' polynomials:

\psi_n(x) = (2^n n! \sqrt{\pi})^{-1/2} \mathrm{e}^{-x^2/2} H^\mathrm{phys}_n(x) = (2^n n! \sqrt{\pi})^{-1/2} \mathrm{e}^{x^2/2} \frac{d^n}{dx^n} \mathrm{e}^{-x^2}

Since these functions contain the square root of the weight function, and have been scaled appropriately, they are orthonormal:

\int_{-\infty}^\infty \psi_n(x)\psi_m(x)\, \mathrm{d}x = \delta_{n\,m}\,

and form an orthonormal basis of L2(R). This fact is equivalent to the corresponding statement for Hermite polynomials (see above).

The Hermite functions are closely related to the Whittaker function (Whittaker and Watson, 1962) D_n(z)\,:

D_n(z) = (n! \sqrt{\pi})^{1/2} \psi_n(z/\sqrt{2}) = \pi^{-1/4} \sqrt{2} \mathrm{e}^{z^2/4} \frac{d^n}{dz^n} \mathrm{e}^{-z^2}

and thereby to other parabolic cylinder functions. The Hermite functions satisfy the differential equation:

\psi_n''(x) + (2n + 1 - x^2) \psi_n(x) = 0\,.

This equation is equivalent to the Schrödinger equation for a harmonic oscillator in quantum mechanics, so these functions are the eigenfunctions.

Hermite functions 0 (black), 1 (red), 2 (blue), 3 (yellow), 4 (green), and 5 (magenta).
Hermite functions 0 (black), 2 (blue), 4 (green), and 50 (magenta).

Recursion relation

Following recursion relations of Hermite polynomials, the Hermite functions obey

\psi_n'(x) = \sqrt{\frac{n}{2}}\psi_{n-1}(x) - \sqrt{\frac{n+1}{2}}\psi_{n+1}(x)

Cramér's inequality

The Hermite functions satisfy the following bound due to Harald Cramér[1][2]

 |\psi_n(x)| \le K

for x real, where the constant K is less than 1.086435.

Hermite functions as eigenfunctions of the Fourier transform

The Hermite functions {\psi}_n(x) are a set of eigenfunctions of the continuous Fourier transform. To see this, take the physicist's version of the generating function and multiply by exp(−x 2/2). This gives

\exp (-x^2/2 + 2xt-t^2) = \sum_{n=0}^\infty \exp (-x^2/2) H_n(x) \frac {t^n}{n!}.\,\!

Choosing the unitary representation of the Fourier transform, the Fourier transform of the left hand side is given by


\begin{align}
\mathcal{F} \{ \exp (-x^2/2 + 2xt-t^2)\}(k) & {} =
\frac{1}{\sqrt{2 \pi}}\int_{-\infty}^\infty \exp (-ixk)\exp (-x^2/2 + 2xt-t^2)\, \mathrm{d}x \\
& {} = \exp (-k^2/2 - 2kit+t^2) \\
& {} = \sum_{n=0}^\infty \exp (-k^2/2) H_n(k) \frac {(-it)^n}{n!}.
\end{align}

The Fourier transform of the right hand side is given by

\mathcal{F} \left\{ \sum_{n=0}^\infty \exp (-x^2/2) H_n(x) \frac {t^n}{n!} \right\} = \sum_{n=0}^\infty \mathcal{F} \left \{ \exp(-x^2/2) H_n(x) \right\} \frac{t^n}{n!}. \,

Equating like powers of t in the transformed versions of the left- and right-hand sides gives

 \mathcal{F} \left\{ \exp (-x^2/2) H_n(x) \right\} = (-i)^n \exp (-k^2/2) H_n(k). \,\!

The Hermite functions  \psi_n(x) are therefore an orthonormal basis of L2(R) which diagonalizes the Fourier transform operator. In this case, we chose the unitary version of the Fourier transform, so the eigenvalues are (−i) n.

Combinatorial interpretation of coefficients

In the Hermite polynomial Hn(x) of variance 1, the absolute value of the coefficient of xk is the number of (unordered) partitions of an n-member set into k singletons and (nk)/2 (unordered) pairs.

Completeness relation

The following identity holds in the sense of distributions[3]

\sum_{n=0}^\infty \psi_n (x) \psi_n (y)= \delta(x-y),

where δ is the Dirac delta function, (ψn) the Hermite functions, and δ(x − y) represents the Lebesgue measure on the line y = x in R2, normalized so that its projection on the horizontal axis is the usual Lebesgue measure. This distributional identity follows by letting u → 1 in Mehler's formula, valid when −1 < u < 1:

E(x, y; u)�:= \sum_{n=0}^\infty u^n \, \psi_n (x) \, \psi_n (y) = \frac 1 {\sqrt{\pi (1 - u^2)}} \, \mathrm{exp} \left( - \frac{1 - u}{1 + u} \, \frac{(x + y)^2}{4} \,-\, \frac{1 + u}{1 - u} \, \frac{(x - y)^2}{4}\right).

The function (xy) → E(xyu) is the density for a Gaussian measure on R2 which is, when u is close to 1, very concentrated around the line y = x, and very spread out on that line. It follows that

 \left\langle \left( \sum_{n=0}^\infty u^n \langle f, \psi_n \rangle \psi_n\right), g \right\rangle = \int\!\!\int E(x, y; u) f(x) \overline{g(y)} \, \mathrm{d}x \, \mathrm{d}y \rightarrow \int f(x) \overline{g(x)} \, \mathrm{d} x = \langle f, g \rangle,

when ƒ, g are continuous and compactly supported. This yields that ƒ can be expressed from the Hermite functions, as sum of a series of vectors in L2(R), namely

 f = \sum_{n=0}^\infty \langle f, \psi_n \rangle \psi_n.

In order to prove the equality above for E(xyu), the Fourier transform of Gaussian functions will be used several times,

 \rho \sqrt{\pi} \, \mathrm{e}^{-\rho^2 x^2 / 4} = \int \mathrm{e}^{isx- s^2/\rho^2}\, \mathrm{d}s, \quad \rho > 0.

The Hermite polynomial is then represented as

 H_n(x) = (-1)^{n} \mathrm{e}^{x^2} \frac {\mathrm{d}^n}{\mathrm{d}x^n} \Bigl( \frac {1}{2\sqrt{\pi}} \int \mathrm{e}^{isx - s^2/4}\, \mathrm{d}s \Bigr) = (-1)^n \mathrm{e}^{x^2}\frac {1}{2\sqrt{\pi}}\int (is)^n \, \mathrm{e}^{isx-  s^2/4}\, \mathrm{d}s.

With this representation for Hn(x) and Hn(y), one sees that


\begin{align}E(x, y; u) &= \sum_{n=0}^\infty \frac{u^n}{2^n n! \sqrt{\pi}} \, H_n(x) H_n(y) \, \mathrm{e}^{ - (x^2+y^2)/2} \\
& =\frac{\mathrm{e}^{(x^2+y^2)/2}}{4\pi\sqrt{\pi}}\int \!\! \int \Bigl( \sum_{n=0}^\infty \frac{1}{2^n n! } (-ust)^n \Bigr) \,  \mathrm{e}^{isx+ity - s^2/4 - t^2/4}\, \mathrm{d}s\,\mathrm{d}t \\
& =\frac{\mathrm{e}^{(x^2+y^2)/2} }{4\pi\sqrt{\pi}}\int \!\! \int \mathrm{e}^{-ust/2} \, \mathrm{e}^{isx+ity - s^2/4 - t^2/4}\, \mathrm{d}s\,\mathrm{d}t,\end{align}

and this implies the desired result, using again the Fourier transform of Gaussian kernels after performing the substitution

s = \frac{\sigma + \tau}{\sqrt 2},\qquad\qquad t = \frac{\sigma - \tau}{\sqrt 2}.

Notes

References

External links